Sparse, Contextually Informed Models for Irony Detection: Exploiting User Communities, Entities and Sentiment
نویسندگان
چکیده
Automatically detecting verbal irony (roughly, sarcasm) in online content is important for many practical applications (e.g., sentiment detection), but it is difficult. Previous approaches have relied predominantly on signal gleaned from word counts and grammatical cues. But such approaches fail to exploit the context in which comments are embedded. We thus propose a novel strategy for verbal irony classification that exploits contextual features, specifically by combining noun phrases and sentiment extracted from comments with the forum type (e.g., conservative or liberal) to which they were posted. We show that this approach improves verbal irony classification performance. Furthermore, because this method generates a very large feature space (and we expect predictive contextual features to be strong but few), we propose a mixed regularization strategy that places a sparsity-inducing `1 penalty on the contextual feature weights on top of the `2 penalty applied to all model coefficients. This increases model sparsity and reduces the variance of model performance.
منابع مشابه
Overlapping Community Detection in Social Networks Based on Stochastic Simulation
Community detection is a task of fundamental importance in social network analysis. Community structures enable us to discover the hidden interactions among the network entities and summarize the network information that can be applied in many applied domains such as bioinformatics, finance, e-commerce and forensic science. There exist a variety of methods for community detection based on diffe...
متن کاملAn Improved Method for Detection of Satire from User-Generated Content
Sarcasm is a form of speech act in which the speakers convey their message in an implicit way. It is a sophisticated form of speech act widely used in online communities. The inherently ambiguous nature of sarcasm sometimes makes it hard even for humans to decide whether an utterance is sarcastic in nature or not. Recognition of sarcasm may anticipate benefits in many sentiment analysis of NLP ...
متن کاملExploring the Realization of Irony in Twitter Data
Cynthia Van Hee, Els Lefever and Véronique Hoste LT3, Language and Translation Technology Team Ghent University Groot-Brittanniëlaan 45, 9000 Ghent, Belgium cynthia.vanhee, els.lefever, [email protected] Abstract Handling figurative language like irony is currently a challenging task in natural language processing. Since irony is commonly used in user-generated content, its presence can ...
متن کاملA multidimensional approach for detecting irony in Twitter
Irony is a pervasive aspect of many online texts, one made all the more difficult by the absence of face-to-face contact and vocal intonation. As our media increasingly become more social, the problem of irony detection will become even more pressing. We describe here a set of textual features for recognizing irony at a linguistic level, especially in short texts created via social media such a...
متن کاملLiars and Saviors in a Sentiment Annotated Corpus of Comments to Political Debates
We investigate the expression of opinions about human entities in user-generated content (UGC). A set of 2,800 online news comments (8,000 sentences) was manually annotated, following a rich annotation scheme designed for this purpose. We conclude that the challenge in performing opinion mining in such type of content is correctly identifying the positive opinions, because (i) they are much les...
متن کامل